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1.  Introduction 


by 

S.  Zacks 


Let 


...,X 


be  i.i.d.  random  variables  having  a normal  distribu- 


tion N(n,a^),  -°°  < p < <*>,  0 < a < where  both  p and  o are  un- 

2 

known.  Consider  the  problem  of  estimating  the  distribution  variance  a . 

— 2 2 
Let  Xn,  Sn  be  the  sample  mean  and  the  sample  variance,  respectively.  Sn 

is  an  equi variant  estimator,  with  respect  to  the  group  S of  real  affine 

2 

transformations.  It  is  well  known  that  S is  inadmissible  for  the  squared- 

n 

2 

error  loss  function.  Moreover,  all  equivariant  estimators  of  a are 

inadmissible  (see  Zacks  [3;  pp.  364]),  this  is  due  to  the  fact  that  Xn  and 

2 — 2 
S are  independent  and  X has  also  some  information  on  o , that  can 
n n 

be  utilized  to  reduce  the  mean- squared- error  (MSE)  of  the  variance  estima- 

2 

tor.  Bayes  estimators  of  a , with  respect  to  the  squared-error  loss,  for 
any  prior  distributions  having  positive  p.d.f.  for  all  points  in  the  para- 
meter space,  are  admissible  estimators  (see  Zacks  [3;  pp.  365]).  The  ques- 
tion is  whether  such  admissible  Bayes  estimators  are  substantially  more 

2 n- 1 2 

efficient  than  the  minimum-MSE  equivariant  estimator  o„  = —77  S . In  the 

£j  n*rx  n 

2 

normal  case  the  proper  Bayes  estimators  of  a have  more  complicated  form 

*2 

than  Sp  and  sometimes  a computer  is  needed  for  their  application.  However, 

£j 

today  the  need  for  using  a computer  is  not  an  obstacle.  The  justification  for 
using  a complicated  estimator  is  only  in  substantial  improvement  of  effi- 
ciency. Box  and  Tiao  [1]  and  Zellner  [5]  provide  formulae  of  formal  Bayes 
estimators  of  o , using  the  improper  Jeffery*  s prior  H(p,  a)  = d^id a/a. 


‘on 

on 


! □ □ 


The  corresponding  formal  Bayes  estimator  is  however  equivariant  and  is  not 

a2 

better  than  3_.  DeGroot  [2]  presents  a proper  Bayes  estimator  which  is 

£j 

admissible.  This  Bayes  estimator  is  derived  for  the  prior  conjugate  dis- 

2 

tributions  so  that,  given  a , the  conditional  prior  distribution  of  p is 

2 2 2 
normal  N(p,  ho)  and  the  prior  marginal  distribution  of  l/2a  is  the 

gamma  distribution  GammaC^, v),  where  i|r-1  is  the  scale  parameter.  The 

Bayes  estimator  corresponding  to  this  prior  model  is 


(1.1) 


a2 

°BD 


* + (n-l)S2  + n(Xn-p)2/(l+nH2) 

n + 2v  - 2 


In  the  present  paper  we  develop  the  Bayes  estimator  suggested  by  Zacks 

[3;  pp.  366],  assuming  conjugate  distributions  of  independent  prior  para- 

2 2 
meters  p and  a • More  specifically,  we  assume  that  p ~N(p,D  ) and 

2 

l/2o  ~ Gamma(\(r#v)  . Although  the  difference  between  DeGroot’ s model  and 

the  present  one  seems  small,  the  two  models  are  actually  quite  different, 

2 

since  in  the  former  the  prior  normal  distribution  of  p depends  on  o • 

In  the  present  model  of  priorly  independent  parameters  we  obtain  a substan- 

2 

tially  more  complicated  Bayes  estimator,  designated  by  • In  Sec- 

2 

tion  3 we  compare  the  relative  efficiencies  of  the  three  estimators  S-  , 

JL 

2 2 

oBD  and  Sgj  . The  Bayesian  framework  developed  here  is  applied  in  Sec- 
tion  4 for  the  derivation  of  the  Bayes  estimator  of  the  variance  of  a 

finite  population,  which  is  discussed  by  Zacks  and  Solomon  [4]. 


2. 


*2 

Derivation  of  the  Bayes  Estimator  Sgj 
2 

Let  8 = l/2o  . The  likelihood  function  of  (p, e)  given  the  minimal 

(X  ,S2)  is 
' n*  n 


sufficient  statistic 


-3- 


n 

(2*1)  L(^, e|x  , S^)  = 02  exp{-n0(X  -p)2  - (n-l)0S2}, 

n n n n 

for  < p < • and  0 < 0 < ®.  it  follows  that  the  posterior  p.d.  f of 

(4#  d)t  given  (Xn,S2),  for  the  independent  conjugate  priors  is 

— + v 1 

(2.2)  k(n,0lx n,S2)  « 02  V exp{-n0(X  -p)2  - -i;(p-p)2 

n 2D 

2 

- e[(n-l)sn  + t]),  -®  < p < ®,  0 < 0 < ®. 


It  is  easy  to  verify  that 


(2.3) 


/ exp{-n0(X  -p)2  - ~^(p-p)2)c4i  = 
n 2D2 


= v^2tt"  D(l  + 2n0D2)"^  exp( ^ =(X  - p)2} 

1 + 2n  0 D n 


2 __  o 

Hence,  the  posterior  expectation  of  0 = l/20,  given  (Xn,S^),  is 


(2.4)  E{a2|Xn,S2)  = 

J 02  (l+2n0D2)'®  expf--  -^0-^(x  -4)2-  0((n-l)s2+^)\d0 

= j.JO L l+2n0D2  n n J 

J 02  (l+2n0D2)-^  exp  f- — — — r(X  -p)2-  0(  (n-l)s2+t(f)\d0 

0 V.  l+2n0D^  n n J 


p _ 

This  is  the  Bayes  estimator  a . By  making  the  transformation  X = 2n0D2 
we  reduce  (2.4)  to 


(2.5) 


(n-l)s2  + t 
n + 2v  - 2 


M1(X,|+  v-1,52) 
M^X,  f + v,  52)  ' 


-4- 


where 


(2.6) 


X = ( (n-l)S^  + *)/2nD2,  62  = (Xn  - m)2/d2 


t i for  each  r = 1,2,...  and  X ~Gamma(X,v) 


(2.7) 


Mr(X,v,62)  = Ef(l+X)  2 exp{  ^ })  . 


2 

The  function  Mr(x, v, 5 ) is  determined  in  the  following  manner.  We  make 
first  the  expansion 

(2.8)  «r(w2, - s,  i?  • 

Let  p(j|x)  denote  the  p.d.f.  of  the  Poisson  with  mean  X.  Then, 


(2.9) 


where 


2 

M (X,v,5  ) = Sp(j|^-)R  (r,  X,v,62), 
J=0  2 3 


o -t  ->V  -&2/2  00  „J+V-1 

(2.10)  R (r,X,v,62)  = (-l)J  A f--  f - A-  e"XX  dx 

3 T(v)  o (l+x)J  /2 


J+v-1 


. ,v  8/2  + X ® / . \ 

- (-i)J  J ^ 


T(v)  1 yj+r/2 


e-Xy  dy 


Suppose  that  v is  an  integer  and  r = 2hh-1,  then 

2 / 

p -i  ,v  8 /2+X  J+v-1  ,./,x  ,\ 

(2.11)  R.(r,X,v,52)  = (-l)J  IS. S (-l)1^  y-1^ 

J r(v)  i=o  v W 

where  generally 

(2.12)  E^(X)  = J y^  e_Xydy,  L = 0,±1,±2,. 


'v-l-m-s 


e vdy,  l = 0,±1,±2, . . . . 


These  exponential  integrals  are  determined  by  the  recursive  formula 


l 


-5- 


(e"X  + (A-i)EjJ_|(X))/x  , ife  1 

(2.13)  E^i( X)  = 

^5(e’X’XE.^(X))  ' As-1 

where 

(2.14)  E_^(x)  = zjl  (1  - *{SZ\)), 

and  $(z)  is  the  standard  normal  integral.  If  v is  not  an  integer  other 
expansions  can  he  attempted  or  the  value  of  the  M- function  can  he  deter- 
mined approximately  by  linear  interpolation  between  the  values  of  the  M- func- 
tion corresponding  to  the  two  integers  adjacent  to  v.  In  the  Appendix  we 
provide  a FORTRAN  subroutine  function  to  compute  Mr(\,v,  6)  for  integer 
values  of  v . 


3.  Relative  Efficiency  Comparisons 


*2  *2 

In  the  present  section  we  compare  the  three  estimators  a„  , and 

Cj  dJJ 


*2 


Sgj  with  respect  to  their  relative  efficiency.  For  the  purpose  of  com- 
paring the  Bayes  estimators  with  the  equivariant  ones  we  define  the  rela- 

2 

tive  efficiency  of  an  estimator  o as  the  ratio  of  its  M3E  to  that  of 

* 2 2 

Sg.  More  specifically,  the  relative  efficiency  function  of  5 is  defined 

as 


(3.1)  RE(o2,ud)  = / E„{  (a  - a2)2), 

2a 

2 2 
where  uo  = (4, a ).  We  derive  first  the  relative  efficiency  of  SgD.  Notice 

that  n(X  -p)2  ~ o2x^[l>  ],  where  x2[ljX]  designates  the  non- 

n 2o 

central  chi- squared  with  1 degree  of  freedom  and  parameter  of 
trality  X.  Let  n7  = n+2v-2  then 


non-cen- 


-6- 


2 2r,  _ n(u-L 


(3.2) 


2 2 , rr  v Tl  • iiAifTEi-l 

« °%[-H  a?  1 

" ■>'  n'(ltriK2)  ' 


2 2 

with  XlM  and  independent.  Hence, 

(3.3,  e(5|d,  . f (x . ♦ 4} 


(3.4) 


r2  . 2qU  1 + 4n(H^) 

BD  (n')2  (l  + n»2)2 


Let  £ = (n-^)/a  and  p = ^ +-n^2  - (2v-l)  + -4;  then  the  relative  effi- 
2 + n,t  2 2 ° " 

ciency  of  depends  only  on  £ , -t;  , h , v and  n and  is  given  by: 

o 

/■»  r\  t^t-i / * 2 ,.2  if  \ n+2v— 2 f , 2v— 1 

(3*6)  RE^°BDiS>'  2>r#v^  “ n+1  1 " n+2v-2 

a L 


1 + 4nC 


+ (n+2v>-2)  (l+nn2)  + ^2v-2)J  ‘ 

a2 

The  estimator  is  considerably  more  complicated  and  no  explicit 

formula  of  its  MSE  can  be  derived.  We  can  compute  its  MSE,  however,  numer- 

ically  in  the  following  manner.  Since  n(Xn-n)  ~o  \ (1}X]  with 
. n r2 

X = g C we  can  write 


f J-  + 2W 

(3.7)  Etcagj-C2)2)^  o4e|  5 • 

M1^J2  + 2W3)  /2nH2,|  + v-l,  2W2(j)/nn2)  “|2>1 

M1((J2  + 2Wl)  / 2nn2  , | + v , 2W2(j)/nH2)  _ J 

''o  ' 

where  W^,W^(j)  are  independent,  ~ Gamma  Wg(j)  ~ Gamma(l,g+j) 

and  J is  a Poisson  r.v.  with  mean  X.  Let  G(xjv)  be  the  c.d.f  of 


-7- 


Gamma(l,v)  at  x,  let  G-1(plv)  be  the  p-th  fractile  of  Gamma  (l,v).  De- 
fine I1  = G_1(.99 1— I2(j)  = G“1(.005|&+j)  and  f2(j)  = G-1(.995|§+j). 

The  risk  function  (3.7)  is  determined  by  computing  first  the  conditional 
expectation  given  J numerically  over  the  range  (0, ^)x(§_g( J), §2( J)). 

The  conditional  expectations  are  then  averaged  with  respect  to  the  Poisson 
distribution  with  mean  The  range  of  integration  for  each  J is  par- 

titioned into  MXM  rectangles.  Let  §^(i)  = il-j/M  for  i = 0,1,  ...,M 
and  let  5g(j,i)  = J 2 (J)  + i(fg(j)  - (j))/M,  i = 0,...,M.  Furthermore, 

let  ? (J,i)  = (§2(j,i)  + C2(J,i-l))/2,  i = l,...,M,  and  let  J*=  Integer  part  of 

* 2 

(x + 4 ) . Then,  a numerical  approximation  to  the  relative  efficiency  of  is 


given  by 

(3.8)  RE(5gIiC,4»M»v)  = -JTl’ 

a 


S p(g1\) 
j=0 


^4+2?1(i1) 


M M 
2 2 

V1  V1 


n 1 1 ^ ^4 + 24(il})  / Zn*4 + v> 2?2(j' ^Vhh2) 


- 1 


(G(li(ii)  |^)  “ °(51(i1-l)  l-2^))  ‘ 


n-1 


(G(S2(j,i2)  U + J)  - G(52(j,i2-1)  | i + j)) 


The  functions  G(x}^+j),  j = 0,1,...  can  be  computed  recursively  accord- 
ing to  the  formula 


(3.9)  G(xl^-j)  = 


xj_*  e"X  + G(x|j-i),  j * 1 


2$(a/2x)  - 1 


, i = o 


-8- 


in  1 

Hie  function  G(x(-^)  is  computed  similarly  if  n is  even.  If  n is 
odd  we  apply  a similar  recursion  with  G(x|l)  = l-e”x.  In  Table  1 we  pro- 

*2  a2 

vide  values  of  the  RE  functions  of  and  aBT  for  n = 10. 

DU  -D-L 

— = 2,6,10}  v = 2,  k = 2 and  Q = 0,  .5  and  1.  We  see  that  for  Q = 0 

o 

is  more  efficient  than  aZ.  Furthermore,  for  Q = 0 and  -4-  small  ofL  con- 
BI  BD  02  BI 

*2 

siderably  more  efficient  than  the  best  equivariant  estimator  Sg.  However, 

*2  2 
when  C ^ • 5 Sgg  is  generally  more  efficient  than  Sg^.,  but  both  esti- 

2 2 

mators  may  be  less  efficient  than  Sg  (recall  that  Sg  is  minimax!). 


2 a2 

Table  1.  Relative  Efficiency  Values  of  6gD  and  Og^ 
for  samples  of  size  n = 10. 


i/cZ 

V 

H 

Q 

BE(4> 

RE(3^) 

Z’  M 

i—  a J 

2.  0 

2.  0 

Ci.  0 

1 . 373 

1 . 34  0 

•3  m 0 

2.  0 

2 . 0 

0.  0 

0.963 

a 3 ; 7 

1 0 . 0 

2.  0 

2.  0 

CI.  Cl 

0.  333 

0. 939 

Zf  ;"| 

2.  Ci 

2.  Ci 

0.5 

1 . 35 1 

0 . 4 6 7 

3.  0 

2.  Ci 

1'.  fl 

Ci . 5 

0. 933 

0.  935 

1 Cl.  0 

2.  0 

2.  Ci 

0 . 5 

0.  331 

0 . 273 

1.  0 

2.  Ci 

2.  0 

1 . d 

1 . 275 

0.  072 

6.  Ci 

2.  0 

2.  0 

1 . 0 

0.  353 

0.  149 

1 Ci.  Ci 

2.  0 

2.  0 

1 . Cl 

0 . 33  0 

0.  04  3 

4.  Estimating  the  Variance  of  a Finite  Population 

Let  x1,...,xN  be  the  values  of  N units  in  a finite  population.  We 

2 ^ 2 

consider  the  problem  of  estimating  the  variance  ^ .S  (x.-n)  , 

^ N 1 

U = ■jj  EXl  on  the  basis  of  a sample  of  n values  X^, ...,Xn  chosen  from 

that  population.  Zacks  and  Solomon  [4]  presented  the  form  of  Bayes  estima- 


tors of  We  derive  here  the  Bayes  estimator  for  the  squared- error  loss 


-9- 


when  the  model  is  that  x^, ...,x^  are  conditionally  i.i.d.  N(p,c  ) 
and  that  p ~ N(p,D  ),  l/2a  ~ Gamma(i|r,  v) . This  model  actually  implies 
that  the  variates  in  the  population  are  exchangeable  random  variables  having 
a distribution  which  is  a mxxtrue  of  normal  distributions.  Without  loss 


of  generality  one  can  assume  that  the  sample  consists  of  the  first  n 


variates  x, 


1'  ‘ - 


Let  x be  the  sample  mean 


. *2  1 V / - \2 

and  a = — L>  (x.-x  ) 


i=l 


2 _* 

the  sample  (classical)  estimate  of  o„.  Let  x^  be  the  mean  of  the 

w iM-n  w 

2 1 * ' ^ 2 

population  variates  which  are  not  in  the  sample  and  n = ^~  ^ (x^-x^  • 

""  i=n+l 

It  is  shown  in  [4]  that 


(4.1) 


/,  m 2 , n/n  nw—  —*  \2 

o«  =u  O.  + „ + M(1-TT)(xn-xN_n^  • 


2 _ n a2 
JN  " H cn 


N N-n  N' 


We  derive  here  the  Bayes  estimator  S?  = E{a^|x  ) according  to  the  above 


°B 


N‘~n 


model.  One  should  determine  the  posterior  expectations  of  t and 

N-n 


of  (xn  - x^  given  the  sample  values  x^  = (x^,  . ..,xn).  Notice  first 

2 

that  since  x^, ...,x^  are  conditionally  i.i.d,  given  p,o 


(4.2) 


E(t  |x  ,o  ) = 

1 N-n1'**!7  N-n 


N-n-1  2 

a 


E{(x  -x*  )2|x  ,a2,p)  = tt—  + (p-x  )2 

n N-n  W7  7 N-n  n 


Hence, 


5B  = Et4li»>  ■ 

| % * a-#Kl-i)EtoEl£„}  * 2(l.S)E{„-x„)2|in) 


(4.3) 


-10- 


We  have  seen  in  Section  2 that 


(U.U) 


E{o2  lx}  = E(o2  1 x ,a2} 
~n  1 n7  n 


no^  + | + v- 1,52) 

n+2v-l  Ml(X,|+v,62) 


where  X - (na^  + \|()/2nD  and  6 = (x  - (a)  /D2.  To  derive  the  posterior 

__  g 

expectation  of  (p.  - x ) we  write  first  (see  Zellner  [5;  pp.  22]) 


(^•5) 


E{(p-xn)2|a2,xn, S2}  = ^ W + (xn-p)2(l-w)2. 


2 2 2 

where  W = D /(D  + a /n).  Finally,  since  ~ W = D2(l  + 2n0D2)-1  and 
(l-W)  = (l  + 2n0D2)  2 we  obtain 

(4-6)  E((p-x  )2|x  ,$2)  = D2E{(l+2n0D2)~1|x  ,£2} 

n n n » 1 n n 

+ (^n-4)2E((l+2n0D2)"2lxn,a2) 


and  the  Bayes  estimator  of  is 

? „ ? „ . nc2  + i|r  M.  (X.-2  + v-  1,52) 

(M)  n v’2 


N/v  N'  n+2v-2 


V^i + v 76 ) 


n,,  n^2  ^(X»2  + V75  ^+5  M5(x,-^  + v,S  ) 


♦Sfr-i* 


^(1,1+ v,52) 


Prior  risk  comparisons  of  the  estimator  Sg  with  the  classical  estimator 
on  are  given  in  [4]. 


REFERENCES 


Box,  G.E.P.  and  Tiao,  G.C.,  Bayesian  Inference  In  Statistical  Analysis, 
Ad  1 son-Wesley,  Reading,  Mass.,  1973. 

DeGroot,  M.H.,  Optimal  Statistical  Decisions,  McGraw-Hill,  New  York, 
1970. 

Zacks,  S.,  The  Theory  of  Statistical  Inference,  John  Wiley,  New  York, 

1971- 

Zacks,  S.  and  Solomon,  H.,  Bayes  and  Equivariant  Estimators  of  the 
Variance  of  a Finite  Population.  Tech.  Report  No.  35>  ONR 
Project  NR  042-276,  Department  of  Mathematics  and  Statistics, 

Case  Western  Reserve  University,  1978. 

Zellner,  A. , An  Introduction  To  Bayesian  Inference  In  Econometrics, 

John  Wiley,  New  York,  1971. 


1-10-79/ts 


r- 


w 


-12- 


. 2. 

Appendix:  FORTRAN  Program  for  the  Computation  of  M (X»v»5  ) 

The  subroutine  function  is  called  BM(lR, AL, V, DS)  where  IR  «-  r, 

2 

AL  «-  X,  v ♦-  v,  DS  «-  & . The  subroutine  function  consists  of  three  main 
parts  (lines  10-170$  180-U30  and  4U0-680).  This  is  then  supported  by 
FUNCTION  C(l,K)  (lines  690-850)  which  computed  the  combinatorial  function 
FUNCTION  P0S(j,AL)  (lines  860-1070)  computes  the  Poisson  c.d.f.  with 
mean  A L \ at  J «-  j.  Finally,  FUNCTION  DNDX(x)  (lines  1080-1270)  com- 

putes the  standard  normal  integral  #(x) . In  addition,  the  gamma  function 
GAMF(W)  «-  T(w)  is  utilized  in  line  300.  This  function  was  computed  with 
the  computer  library  subroutine.  If  such  a routine  is  not  available  one 
should  supplement  a subroutine  FUNCTION  GAMF(w). 


A A 1 A 

f 1 1 n n x t 0 n b r-i  ( :i:  r . a i..  ? v n s > 

0002.0 

.JR  = I R 

00030 

Bi..  --AL 

000  A0 

w^y 

OOO.'iO 

ES-DS/2 

00020 

I UR- 1 N T ( W > 

0<K  70 

K:  ' JR-D/2 

00080 

AS*ES+4.*SGRT<ES> 

C 0 0 ? 0 

JS"!  I NT  ( 1T8 ) v 1 
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